Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Life (Basel) ; 13(12)2023 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-38137893

RESUMO

BACKGROUND: Mobile phones, laptops, and computers have become an indispensable part of our lives in recent years. Workers may have an incorrect posture when using a computer for a prolonged period of time. Using these products with an incorrect posture can lead to neck pain. However, there are limited data on postures in real-life situations. METHODS: In this study, we used a common camera to record images of subjects carrying out three different tasks (a typing task, a gaming task, and a video-watching task) on a computer. Different artificial intelligence (AI)-based pose estimation approaches were applied to analyze the head's yaw, pitch, and roll and coordinate information of the eyes, nose, neck, and shoulders in the images. We used machine learning models such as random forest, XGBoost, logistic regression, and ensemble learning to build a model to predict whether a subject had neck pain by analyzing their posture when using the computer. RESULTS: After feature selection and adjustment of the predictive models, nested cross-validation was applied to evaluate the models and fine-tune the hyperparameters. Finally, the ensemble learning approach was utilized to construct a model via bagging, which achieved a performance with 87% accuracy, 92% precision, 80.3% recall, 95.5% specificity, and an AUROC of 0.878. CONCLUSIONS: We developed a predictive model for the identification of non-specific neck pain using 2D video images without the need for costly devices, advanced environment settings, or extra sensors. This method could provide an effective way for clinically evaluating poor posture during real-world computer usage scenarios.

2.
Front Oncol ; 12: 862326, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35795066

RESUMO

Background and Purpose: Benzimidazoles have attracted much attention over the last few decades due to their broad-spectrum pharmacological properties. Increasing evidence is showing the potential use of benzimidazoles as anti-angiogenic agents, although the mechanisms that impact angiogenesis remain to be fully defined. In this study, we aim to investigate the anti-angiogenic mechanisms of MFB, a novel 2-aminobenzimidazole derivative, to develop a novel angiogenesis inhibitor. Experimental Approach: MTT, BrdU, migration and invasion assays, and immunoblotting were employed to examine MFB's effects on vascular endothelial growth factor (VEGF)-induced endothelial cell proliferation, migration, invasion, as well as signaling molecules activation. The anti-angiogenic effects of MFB were analyzed by tube formation, aorta ring sprouting, and matrigel plug assays. We also used a mouse model of lung metastasis to determine the MFB's anti-metastatic effects. Key Results: MFB suppressed cell proliferation, migration, invasion, and endothelial tube formation of VEGF-A-stimulated human umbilical vascular endothelial cells (HUVECs) or VEGF-C-stimulated lymphatic endothelial cells (LECs). MFB suppressed VEGF-A and VEGF-C signaling in HUVECs or LECs. In addition, MFB reduced VEGF-A- or tumor cells-induced neovascularization in vivo. MFB also diminished B16F10 melanoma lung metastasis. The molecular docking results further showed that MFB may bind to VEGFR-2 rather than VEGF-A with high affinity. Conclusions and Implications: These observations indicated that MFB may target VEGF/VEGFR signaling to suppress angiogenesis and lymphangiogenesis. It also supports the role of MFB as a potential lead in developing novel agents for the treatment of angiogenesis- or lymphangiogenesis-associated diseases and cancer.

3.
Nucleic Acids Res ; 50(D1): D471-D479, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34788852

RESUMO

Protein post-translational modifications (PTMs) play an important role in different cellular processes. In view of the importance of PTMs in cellular functions and the massive data accumulated by the rapid development of mass spectrometry (MS)-based proteomics, this paper presents an update of dbPTM with over 2 777 000 PTM substrate sites obtained from existing databases and manual curation of literature, of which more than 2 235 000 entries are experimentally verified. This update has manually curated over 42 new modification types that were not included in the previous version. Due to the increasing number of studies on the mechanism of PTMs in the past few years, a great deal of upstream regulatory proteins of PTM substrate sites have been revealed. The updated dbPTM thus collates regulatory information from databases and literature, and merges them into a protein-protein interaction network. To enhance the understanding of the association between PTMs and molecular functions/cellular processes, the functional annotations of PTMs are curated and integrated into the database. In addition, the existing PTM-related resources, including annotation databases and prediction tools are also renewed. Overall, in this update, we would like to provide users with the most abundant data and comprehensive annotations on PTMs of proteins. The updated dbPTM is now freely accessible at https://awi.cuhk.edu.cn/dbPTM/.


Assuntos
Bases de Dados de Proteínas , Redes Reguladoras de Genes , Processamento de Proteína Pós-Traducional , Proteínas/metabolismo , Software , Animais , Arabidopsis/genética , Arabidopsis/metabolismo , Bactérias/genética , Bactérias/metabolismo , Humanos , Internet , Camundongos , Modelos Moleculares , Anotação de Sequência Molecular , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas , Proteínas/química , Proteínas/genética , Ratos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo
4.
J Pers Med ; 11(11)2021 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-34834399

RESUMO

The molecular heterogeneity of gene expression profiles of glioblastoma multiforme (GBM) are the most important prognostic factors for tumor recurrence and drug resistance. Thus, the aim of this study was to identify potential target genes related to temozolomide (TMZ) resistance and GBM recurrence. The genomic data of patients with GBM from The Cancer Genome Atlas (TCGA; 154 primary and 13 recurrent tumors) and a local cohort (29 primary and 4 recurrent tumors), samples from different tumor regions from a local cohort (29 tumor and 25 peritumoral regions), and Gene Expression Omnibus data (GSE84465, single-cell RNA sequencing; 3589 cells) were included in this study. Critical gene signatures were identified based an analysis of differentially expressed genes (DEGs). DEGs were further used to evaluate gene enrichment levels among primary and recurrent GBMs and different tumor regions through gene set enrichment analysis. Protein-protein interactions (PPIs) were incorporated into gene regulatory networks to identify the affected metabolic pathways. The enrichment levels of 135 genes were identified in the peritumoral regions as being risk signatures for tumor recurrence. Fourteen genes (DVL1, PRKACB, ARRB1, APC, MAPK9, CAMK2A, PRKCB, CACNA1A, ERBB4, RASGRF1, NF1, RPS6KA2, MAPK8IP2, and PPM1A) derived from the PPI network of 135 genes were upregulated and involved in the regulation of cancer stem cell (CSC) development and relevant signaling pathways (Notch, Hedgehog, Wnt, and MAPK). The single-cell data analysis results indicated that 14 key genes were mainly expressed in oligodendrocyte progenitor cells, which could produce a CSC niche in the peritumoral region. The enrichment levels of 336 genes were identified as biomarkers for evaluating TMZ resistance in the solid tumor region. Eleven genes (ARID5A, CDC42EP3, CDKN1A, FLT3, JUNB, MAP2K3, MYBPC2, RGS14, RNASEK, TBC1D30, and TXNDC11) derived from the PPI network of 336 genes were upregulated and may be associated with a high risk of TMZ resistance; these genes were identified in both the TCGA and local cohorts. Furthermore, the expression patterns of ARID5A, CDKN1A, and MAP2K3 were identical to the gene signatures of TMZ-resistant cell lines. The identified enrichment levels of the two gene sets expressed in tumor and peritumoral regions are potentially helpful for evaluating TMZ resistance in GBM. Moreover, these key genes could be used as biomarkers, potentially providing new molecular strategies for GBM treatment.

5.
Int J Nanomedicine ; 16: 5233-5246, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34366665

RESUMO

PURPOSE: Targeted superparamagnetic iron oxide (SPIO) nanoparticles are a promising tool for molecular magnetic resonance imaging (MRI) diagnosis. Lipid-coated SPIO nanoparticles have a nonfouling property that can reduce nonspecific binding to off-target cells and prevent agglomeration, making them suitable contrast agents for molecular MRI diagnosis. PD-L1 is a poor prognostic factor for patients with glioblastoma. Most recurrent glioblastomas are temozolomide resistant. Diagnostic probes targeting PD-L1 could facilitate early diagnosis and be used to predict responses to targeted PD-L1 immunotherapy in patients with primary or recurrent glioblastoma. We conjugated lipid-coated SPIO nanoparticles with PD-L1 antibodies to identify PD-L1 expression in glioblastoma or temozolomide-resistant glioblastoma by using MRI. METHODS: The synthesized PD-L1 antibody-conjugated SPIO (PDL1-SPIO) nanoparticles were characterized using dynamic light scattering, zeta potential assays, transmission electron microscopy images, Prussian blue assay, in vitro cell affinity assay, and animal MRI analysis. RESULTS: PDL1-SPIO exhibited a specific binding capacity to PD-L1 of the mouse glioblastoma cell line (GL261). The presence and quantity of PDL1-SPIO in temozolomide-resistant glioblastoma cells and tumor tissue were confirmed through Prussian blue staining and in vivo T2* map MRI, respectively. CONCLUSION: This is the first study to demonstrate that PDL1-SPIO can specifically target temozolomide-resistant glioblastoma with PD-L1 expression in the brain and can be quantified through MRI analysis, thus making it suitable for the diagnosis of PD-L1 expression in temozolomide-resistant glioblastoma in vivo.


Assuntos
Glioblastoma , Animais , Antígeno B7-H1 , Linhagem Celular Tumoral , Meios de Contraste , Compostos Férricos , Glioblastoma/diagnóstico por imagem , Glioblastoma/tratamento farmacológico , Humanos , Lipídeos , Nanopartículas Magnéticas de Óxido de Ferro , Imageamento por Ressonância Magnética , Nanopartículas de Magnetita , Camundongos , Temozolomida/farmacologia
6.
Cancers (Basel) ; 13(11)2021 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-34064004

RESUMO

This study is to identify potential multiomics biomarkers for the early detection of the prognostic recurrence of PC patients. A total of 494 prostate adenocarcinoma (PRAD) patients (60-recurrent included) from the Cancer Genome Atlas (TCGA) portal were analyzed using the autoencoder model and similarity network fusion. Then, multiomics panels were constructed according to the intersected omics biomarkers identified from the two models. Six intersected omics biomarkers, TELO2, ZMYND19, miR-143, miR-378a, cg00687383 (MED4), and cg02318866 (JMJD6; METTL23), were collected for multiomics panel construction. The difference between the Kaplan-Meier curves of high and low recurrence-risk groups generated from the multiomics panel achieved p-value = 5.33 × 10-9, which is better than the former study (p-value = 5 × 10-7). Additionally, when evaluating the selected multiomics biomarkers with clinical information (Gleason score, age, and cancer stage), a high-performance prediction model was generated with C-index = 0.713, p-value = 2.97 × 10-15, and AUC = 0.789. The risk score generated from the selected multiomics biomarkers worked as an effective indicator for the prediction of PRAD recurrence. This study helps us to understand the etiology and pathways of PRAD and further benefits both patients and physicians with potential prognostic biomarkers when making clinical decisions after surgical treatment.

7.
Cancers (Basel) ; 12(10)2020 Oct 19.
Artigo em Inglês | MEDLINE | ID: mdl-33086550

RESUMO

Characterization of immunophenotypes in glioblastoma (GBM) is important for therapeutic stratification and helps predict treatment response and prognosis. Radiomics can be used to predict molecular subtypes and gene expression levels. However, whether radiomics aids immunophenotyping prediction is still unknown. In this study, to classify immunophenotypes in patients with GBM, we developed machine learning-based magnetic resonance (MR) radiomic models to evaluate the enrichment levels of four immune subsets: Cytotoxic T lymphocytes (CTLs), activated dendritic cells, regulatory T cells (Tregs), and myeloid-derived suppressor cells (MDSCs). Independent testing data and the leave-one-out cross-validation method were used to evaluate model effectiveness and model performance, respectively. We identified five immunophenotypes (G1 to G5) based on the enrichment level for the four immune subsets. G2 had the worst prognosis and comprised highly enriched MDSCs and lowly enriched CTLs. G3 had the best prognosis and comprised lowly enriched MDSCs and Tregs and highly enriched CTLs. The average accuracy of T1-weighted contrasted MR radiomics models of the enrichment level for the four immune subsets reached 79% and predicted G2, G3, and the "immune-cold" phenotype (G1) according to our radiomics models. Our radiomic immunophenotyping models feasibly characterize the immunophenotypes of GBM and can predict patient prognosis.

8.
PLoS One ; 15(4): e0231594, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32315343

RESUMO

Recurrence and poorly differentiated (grade 3 and above) and atypical cell type endometrial cancer (EC) have poor prognosis outcome. The mechanisms and characteristics of recurrence and distal metastasis of EC remain unclear. The extracellular matrix (ECM) of the reproductive tract in women undergoes extensive structural remodelling changes every month. Altered ECMs surrounding cells were believed to play crucial roles in a cancer progression. To decipher the associations between ECM and EC development, we generated a PAN-ECM Data list of 1516 genes including ECM molecules (ECMs), synthetic and degradation enzymes for ECMs, ECM receptors, and soluble molecules that regulate ECM and used RNA-Seq data from The Cancer Genome Atlas (TCGA) for the studies. The alterations of PAN-ECM genes by comparing the RNA-Seq expressions profiles of EC samples which have been grouped as tumorigenesis and metastasis group based on their pathological grading were identified. Differential analyses including functional enrichment, co-expression network, and molecular network analysis were carried out to identify the specific PAN-ECM genes that may involve in the progression of EC. Eight hundred and thirty-one and 241 PAN-ECM genes were significantly involved in tumorigenesis (p-value <1.571e-15) and metastasis (p-value <2.2e-16), respectively, whereas 140 genes were in the intersection of tumorigenesis and metastasis. Interestingly, 92 of the 140 intersecting PAN-ECM genes showed contrasting fold changes between the tumorigenesis and metastasis datasets. Enrichment analysis for the contrast PAN-ECM genes indicated pathways such as GP6 signaling, ILK signaling, and interleukin (IL)-8 signaling pathways were activated in metastasis but inhibited in tumorigenesis. The significantly activated ECM and ECM associated genes in GP6 signaling, ILK signaling, and interleukin (IL)-8 signaling pathways may play crucial roles in metastasis of EC. Our study provides a better understanding of the etiology and the progression of EC.


Assuntos
Carcinogênese/genética , Neoplasias do Endométrio/genética , Matriz Extracelular/genética , Proteínas de Neoplasias/genética , Biologia Computacional , Progressão da Doença , Neoplasias do Endométrio/patologia , Endométrio/metabolismo , Endométrio/patologia , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Metástase Neoplásica , RNA-Seq , Receptores de Superfície Celular/genética , Transdução de Sinais/genética
9.
Sci Rep ; 9(1): 16175, 2019 11 07.
Artigo em Inglês | MEDLINE | ID: mdl-31700141

RESUMO

Succinylation is a type of protein post-translational modification (PTM), which can play important roles in a variety of cellular processes. Due to an increasing number of site-specific succinylated peptides obtained from high-throughput mass spectrometry (MS), various tools have been developed for computationally identifying succinylated sites on proteins. However, most of these tools predict succinylation sites based on traditional machine learning methods. Hence, this work aimed to carry out the succinylation site prediction based on a deep learning model. The abundance of MS-verified succinylated peptides enabled the investigation of substrate site specificity of succinylation sites through sequence-based attributes, such as position-specific amino acid composition, the composition of k-spaced amino acid pairs (CKSAAP), and position-specific scoring matrix (PSSM). Additionally, the maximal dependence decomposition (MDD) was adopted to detect the substrate signatures of lysine succinylation sites by dividing all succinylated sequences into several groups with conserved substrate motifs. According to the results of ten-fold cross-validation, the deep learning model trained using PSSM and informative CKSAAP attributes can reach the best predictive performance and also perform better than traditional machine-learning methods. Moreover, an independent testing dataset that truly did not exist in the training dataset was used to compare the proposed method with six existing prediction tools. The testing dataset comprised of 218 positive and 2621 negative instances, and the proposed model could yield a promising performance with 84.40% sensitivity, 86.99% specificity, 86.79% accuracy, and an MCC value of 0.489. Finally, the proposed method has been implemented as a web-based prediction tool (CNN-SuccSite), which is now freely accessible at http://csb.cse.yzu.edu.tw/CNN-SuccSite/ .


Assuntos
Bases de Dados de Proteínas , Aprendizado Profundo , Processamento de Proteína Pós-Traducional , Proteínas , Análise de Sequência de Proteína , Ácido Succínico/metabolismo , Lisina/genética , Lisina/metabolismo , Proteínas/genética , Proteínas/metabolismo
10.
BMC Med Genomics ; 11(Suppl 7): 34, 2019 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-30894197

RESUMO

BACKGROUND: Recent studies have proposed several gene signatures as biomarkers for different grades of gliomas from various perspectives. However, most of these genes can only be used appropriately for patients with specific grades of gliomas. METHODS: In this study, we aimed to identify survival-relevant genes shared between glioblastoma multiforme (GBM) and lower-grade glioma (LGG), which could be used as potential biomarkers to classify patients into different risk groups. Cox proportional hazard regression model (Cox model) was used to extract relative genes, and effectiveness of genes was estimated against random forest regression. Finally, risk models were constructed with logistic regression. RESULTS: We identified 104 key genes that were shared between GBM and LGG, which could be significantly correlated with patients' survival based on next-generation sequencing data obtained from The Cancer Genome Atlas for gene expression analysis. The effectiveness of these genes in the survival prediction of GBM and LGG was evaluated, and the average receiver operating characteristic curve (ROC) area under the curve values ranged from 0.7 to 0.8. Gene set enrichment analysis revealed that these genes were involved in eight significant pathways and 23 molecular functions. Moreover, the expressions of ten (CTSZ, EFEMP2, ITGA5, KDELR2, MDK, MICALL2, MAP 2 K3, PLAUR, SERPINE1, and SOCS3) of these genes were significantly higher in GBM than in LGG, and comparing their expression levels to those of the proposed control genes (TBP, IPO8, and SDHA) could have the potential capability to classify patients into high- and low- risk groups, which differ significantly in the overall survival. Signatures of candidate genes were validated, by multiple microarray datasets from Gene Expression Omnibus, to increase the robustness of using these potential prognostic factors. In both the GBM and LGG cohort study, most of the patients in the high-risk group had the IDH1 wild-type gene, and those in the low-risk group had IDH1 mutations. Moreover, most of the high-risk patients with LGG possessed a 1p/19q-noncodeletion. CONCLUSION: In this study, we identified survival relevant genes which were shared between GBM and LGG, and those enabled to classify patients into high- and low-risk groups based on expression level analysis. Both the risk groups could be correlated with the well-known genetic variants, thus suggesting their potential prognostic value in clinical application.


Assuntos
Biomarcadores Tumorais/genética , Neoplasias Encefálicas/genética , Glioblastoma/genética , Glioma/genética , Transcriptoma , Adulto , Idoso , Neoplasias Encefálicas/fisiopatologia , Estudos de Coortes , Feminino , Glioblastoma/fisiopatologia , Glioma/fisiopatologia , Humanos , Masculino , Pessoa de Meia-Idade , Prognóstico , Modelos de Riscos Proporcionais , Fatores de Risco , Análise de Sobrevida
11.
BMC Bioinformatics ; 19(Suppl 13): 384, 2019 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-30717647

RESUMO

BACKGROUND: Glutarylation, the addition of a glutaryl group (five carbons) to a lysine residue of a protein molecule, is an important post-translational modification and plays a regulatory role in a variety of physiological and biological processes. As the number of experimentally identified glutarylated peptides increases, it becomes imperative to investigate substrate motifs to enhance the study of protein glutarylation. We carried out a bioinformatics investigation of glutarylation sites based on amino acid composition using a public database containing information on 430 non-homologous glutarylation sites. RESULTS: The TwoSampleLogo analysis indicates that positively charged and polar amino acids surrounding glutarylated sites may be associated with the specificity in substrate site of protein glutarylation. Additionally, the chi-squared test was utilized to explore the intrinsic interdependence between two positions around glutarylation sites. Further, maximal dependence decomposition (MDD), which consists of partitioning a large-scale dataset into subgroups with statistically significant amino acid conservation, was used to capture motif signatures of glutarylation sites. We considered single features, such as amino acid composition (AAC), amino acid pair composition (AAPC), and composition of k-spaced amino acid pairs (CKSAAP), as well as the effectiveness of incorporating MDD-identified substrate motifs into an integrated prediction model. Evaluation by five-fold cross-validation showed that AAC was most effective in discriminating between glutarylation and non-glutarylation sites, according to support vector machine (SVM). CONCLUSIONS: The SVM model integrating MDD-identified substrate motifs performed well, with a sensitivity of 0.677, a specificity of 0.619, an accuracy of 0.638, and a Matthews Correlation Coefficient (MCC) value of 0.28. Using an independent testing dataset (46 glutarylated and 92 non-glutarylated sites) obtained from the literature, we demonstrated that the integrated SVM model could improve the predictive performance effectively, yielding a balanced sensitivity and specificity of 0.652 and 0.739, respectively. This integrated SVM model has been implemented as a web-based system (MDDGlutar), which is now freely available at http://csb.cse.yzu.edu.tw/MDDGlutar/ .


Assuntos
Biologia Computacional/métodos , Glutaratos/metabolismo , Lisina/metabolismo , Motivos de Aminoácidos , Sequência de Aminoácidos , Animais , Bases de Dados de Proteínas , Lisina/química , Camundongos , Proteínas/química , Curva ROC , Reprodutibilidade dos Testes , Especificidade por Substrato , Máquina de Vetores de Suporte , Interface Usuário-Computador
12.
Clin Cancer Res ; 24(18): 4429-4436, 2018 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-29789422

RESUMO

Purpose: The new classification announced by the World Health Organization in 2016 recognized five molecular subtypes of diffuse gliomas based on isocitrate dehydrogenase (IDH) and 1p/19q genotypes in addition to histologic phenotypes. We aim to determine whether clinical MRI can stratify these molecular subtypes to benefit the diagnosis and monitoring of gliomas.Experimental Design: The data from 456 subjects with gliomas were obtained from The Cancer Imaging Archive. Overall, 214 subjects, including 106 cases of glioblastomas and 108 cases of lower grade gliomas with preoperative MRI, survival data, histology, IDH, and 1p/19q status were included. We proposed a three-level machine-learning model based on multimodal MR radiomics to classify glioma subtypes. An independent dataset with 70 glioma subjects was further collected to verify the model performance.Results: The IDH and 1p/19q status of gliomas can be classified by radiomics and machine-learning approaches, with areas under ROC curves between 0.922 and 0.975 and accuracies between 87.7% and 96.1% estimated on the training dataset. The test on the validation dataset showed a comparable model performance with that on the training dataset, suggesting the efficacy of the trained classifiers. The classification of 5 molecular subtypes solely based on the MR phenotypes achieved an 81.8% accuracy, and a higher accuracy of 89.2% could be achieved if the histology diagnosis is available.Conclusions: The MR radiomics-based method provides a reliable alternative to determine the histology and molecular subtypes of gliomas. Clin Cancer Res; 24(18); 4429-36. ©2018 AACR.


Assuntos
Glioma/diagnóstico por imagem , Glioma/genética , Isocitrato Desidrogenase/genética , Imageamento por Ressonância Magnética , Deleção Cromossômica , Cromossomos Humanos Par 1/genética , Cromossomos Humanos Par 19/genética , Feminino , Genótipo , Glioma/classificação , Glioma/patologia , Humanos , Aprendizado de Máquina , Masculino , Pessoa de Meia-Idade , Mutação
13.
BMC Syst Biol ; 11(Suppl 7): 132, 2017 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-29322920

RESUMO

BACKGROUND: Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. RESULTS: In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking tools for exploring the structural characteristics of PTMs, is presented. In addition, all tertiary structures of PTM sites on proteins can be visualized using the JSmol program. CONCLUSION: Resolving the function of PTM sites is important for understanding the role that proteins play in biological mechanisms. Our work attempted to delineate the structural correlation between PTM sites and PPI or drug-target binding. CurxPTM could help scientists narrow the scope of their PTM research and enhance the efficiency of PTM identification in the face of big proteome data. CruxPTM is now available at http://csb.cse.yzu.edu.tw/CruxPTM/ .


Assuntos
Preparações Farmacêuticas/metabolismo , Mapeamento de Interação de Proteínas , Processamento de Proteína Pós-Traducional , Proteínas/metabolismo , Humanos , Modelos Moleculares , Ligação Proteica , Estrutura Terciária de Proteína , Proteínas/química
14.
BMC Syst Biol ; 11(Suppl 7): 137, 2017 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-29322938

RESUMO

BACKGROUND: Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson's disease, and Alzheimer's disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures. RESULTS: By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing. CONCLUSION: This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/ ) and are also anticipated to facilitate the study of large-scale carbonylated proteomes.


Assuntos
Modelos Moleculares , Carbonilação Proteica , Proteínas/química , Proteínas/metabolismo , Motivos de Aminoácidos , Sequência de Aminoácidos , Sítios de Ligação , Internet
15.
J Comput Aided Mol Des ; 28(1): 49-60, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24442949

RESUMO

Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.


Assuntos
Processamento Alternativo/genética , Aminoácidos/química , Estrutura Terciária de Proteína , Proteínas de Ligação a RNA/química , Sequência de Aminoácidos , Splicing de RNA/genética , Saccharomyces cerevisiae , Spliceossomos/química
16.
BMC Bioinformatics ; 14 Suppl 2: S4, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23369107

RESUMO

BACKGROUND: Functional RNA molecules participate in numerous biological processes, ranging from gene regulation to protein synthesis. Analysis of functional RNA motifs and elements in RNA sequences can obtain useful information for deciphering RNA regulatory mechanisms. Our previous work, RegRNA, is widely used in the identification of regulatory motifs, and this work extends it by incorporating more comprehensive and updated data sources and analytical approaches into a new platform. METHODS AND RESULTS: An integrated web-based system, RegRNA 2.0, has been developed for comprehensively identifying the functional RNA motifs and sites in an input RNA sequence. Numerous data sources and analytical approaches are integrated, and several types of functional RNA motifs and sites can be identified by RegRNA 2.0: (i) splicing donor/acceptor sites; (ii) splicing regulatory motifs; (iii) polyadenylation sites; (iv) ribosome binding sites; (v) rho-independent terminator; (vi) motifs in mRNA 5'-untranslated region (5'UTR) and 3'UTR; (vii) AU-rich elements; (viii) C-to-U editing sites; (ix) riboswitches; (x) RNA cis-regulatory elements; (xi) transcriptional regulatory motifs; (xii) user-defined motifs; (xiii) similar functional RNA sequences; (xiv) microRNA target sites; (xv) non-coding RNA hybridization sites; (xvi) long stems; (xvii) open reading frames; (xviii) related information of an RNA sequence. User can submit an RNA sequence and obtain the predictive results through RegRNA 2.0 web page. CONCLUSIONS: RegRNA 2.0 is an easy to use web server for identifying regulatory RNA motifs and functional sites. Through its integrated user-friendly interface, user is capable of using various analytical approaches and observing results with graphical visualization conveniently. RegRNA 2.0 is now available at http://regrna2.mbc.nctu.edu.tw.


Assuntos
Motivos de Nucleotídeos , Sequências Reguladoras de Ácido Nucleico , Software , Sequência de Bases , Gráficos por Computador , Regulação da Expressão Gênica , Internet , Interface Usuário-Computador
17.
BMC Genomics ; 13 Suppl 1: S3, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22369687

RESUMO

BACKGROUND: Sequence features in promoter regions are involved in regulating gene transcription initiation. Although numerous computational methods have been developed for predicting transcriptional start sites (TSSs) or transcription factor (TF) binding sites (TFBSs), they lack annotations for do not consider some important regulatory features such as CpG islands, tandem repeats, the TATA box, CCAAT box, GC box, over-represented oligonucleotides, DNA stability, and GC content. Additionally, the combinatorial interaction of TFs regulates the gene group that is associated with same expression pattern. To investigate gene transcriptional regulation, an integrated system that annotates regulatory features in a promoter sequence and detects co-regulation of TFs in a group of genes is needed. RESULTS: This work identifies TSSs and regulatory features in a promoter sequence, and recognizes co-occurrence of cis-regulatory elements in co-expressed genes using a novel system. Three well-known TSS prediction tools are incorporated with orthologous conserved features, such as CpG islands, nucleotide composition, over-represented hexamer nucleotides, and DNA stability, to construct the novel Gene Promoter Miner (GPMiner) using a support vector machine (SVM). According to five-fold cross-validation results, the predictive sensitivity and specificity are both roughly 80%. The proposed system allows users to input a group of gene names/symbols, enabling the co-occurrence of TFBSs to be determined. Additionally, an input sequence can also be analyzed for homogeneity of experimental mammalian promoter sequences, and conserved regulatory features between homologous promoters can be observed through cross-species analysis. After identifying promoter regions, regulatory features are visualized graphically to facilitate gene promoter observations. CONCLUSIONS: The GPMiner, which has a user-friendly input/output interface, has numerous benefits in analyzing human and mouse promoters. The proposed system is freely available at http://GPMiner.mbc.nctu.edu.tw/.


Assuntos
Biologia Computacional/métodos , Sequências Reguladoras de Ácido Nucleico/genética , Algoritmos , Animais , Composição de Bases/genética , Sítios de Ligação/genética , Ilhas de CpG/genética , Bases de Dados Genéticas , Humanos , Regiões Promotoras Genéticas/genética , Software , Máquina de Vetores de Suporte , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Sítio de Iniciação de Transcrição/fisiologia
18.
PLoS One ; 6(11): e27567, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-22110674

RESUMO

Regulation of pre-mRNA splicing is achieved through the interaction of RNA sequence elements and a variety of RNA-splicing related proteins (splicing factors). The splicing machinery in humans is not yet fully elucidated, partly because splicing factors in humans have not been exhaustively identified. Furthermore, experimental methods for splicing factor identification are time-consuming and lab-intensive. Although many computational methods have been proposed for the identification of RNA-binding proteins, there exists no development that focuses on the identification of RNA-splicing related proteins so far. Therefore, we are motivated to design a method that focuses on the identification of human splicing factors using experimentally verified splicing factors. The investigation of amino acid composition reveals that there are remarkable differences between splicing factors and non-splicing proteins. A support vector machine (SVM) is utilized to construct a predictive model, and the five-fold cross-validation evaluation indicates that the SVM model trained with amino acid composition could provide a promising accuracy (80.22%). Another basic feature, amino acid dipeptide composition, is also examined to yield a similar predictive performance to amino acid composition. In addition, this work presents that the incorporation of evolutionary information and domain information could improve the predictive performance. The constructed models have been demonstrated to effectively classify (73.65% accuracy) an independent data set of human splicing factors. The result of independent testing indicates that in silico identification could be a feasible means of conducting preliminary analyses of splicing factors and significantly reducing the number of potential targets that require further in vivo or in vitro confirmation.


Assuntos
Biologia Computacional/métodos , Evolução Molecular , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , Sequência de Aminoácidos , Dipeptídeos/química , Humanos , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Proteínas de Ligação a RNA/genética , Reprodutibilidade dos Testes , Máquina de Vetores de Suporte
19.
BMC Bioinformatics ; 12: 300, 2011 Jul 26.
Artigo em Inglês | MEDLINE | ID: mdl-21791068

RESUMO

BACKGROUND: MicroRNAs (miRNAs) are small non-coding RNA molecules that are ~22-nt-long sequences capable of suppressing protein synthesis. Previous research has suggested that miRNAs regulate 30% or more of the human protein-coding genes. The aim of this work is to consider various analyzing scenarios in the identification of miRNA-target interactions, as well as to provide an integrated system that will aid in facilitating investigation on the influence of miRNA targets by alternative splicing and the biological function of miRNAs in biological pathways. RESULTS: This work presents an integrated system, miRTar, which adopts various analyzing scenarios to identify putative miRNA target sites of the gene transcripts and elucidates the biological functions of miRNAs toward their targets in biological pathways. The system has three major features. First, the prediction system is able to consider various analyzing scenarios (1 miRNA:1 gene, 1:N, N:1, N:M, all miRNAs:N genes, and N miRNAs: genes involved in a pathway) to easily identify the regulatory relationships between interesting miRNAs and their targets, in 3'UTR, 5'UTR and coding regions. Second, miRTar can analyze and highlight a group of miRNA-regulated genes that participate in particular KEGG pathways to elucidate the biological roles of miRNAs in biological pathways. Third, miRTar can provide further information for elucidating the miRNA regulation, i.e., miRNA-target interactions, affected by alternative splicing. CONCLUSIONS: In this work, we developed an integrated resource, miRTar, to enable biologists to easily identify the biological functions and regulatory relationships between a group of known/putative miRNAs and protein coding genes. miRTar is now available at http://miRTar.mbc.nctu.edu.tw/.


Assuntos
MicroRNAs/genética , RNA Mensageiro/genética , Regiões 3' não Traduzidas , Regiões 5' não Traduzidas , Regulação da Expressão Gênica , Humanos , MicroRNAs/metabolismo , Biossíntese de Proteínas
20.
J Comput Chem ; 31(15): 2759-71, 2010 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-20839302

RESUMO

Protein acetylation, which is catalyzed by acetyltransferases, is a type of post-translational modification and crucial to numerous essential biological processes, including transcriptional regulation, apoptosis, and cytokine signaling. As the experimental identification of protein acetylation sites is time consuming and laboratory intensive, several computational approaches have been developed for identifying the candidates of experimental validation. In this work, solvent accessibility and the physicochemical properties of proteins are utilized to identify acetylated alanine, glycine, lysine, methionine, serine, and threonine. A two-stage support vector machine was applied to learn the computational models with combinations of amino acid sequences, and the accessible surface area and physicochemical properties of proteins. The predictive accuracy thus achieved is 5% to 14% higher than that of models trained using only amino acid sequences. Additionally, the substrate specificity of the acetylated site was investigated in detail with reference to the subcellular colocalization of acetyltransferases and acetylated proteins. The proposed method, N-Ace, is evaluated using independent test sets in various acetylated residues and predictive accuracies of 90% were achieved, indicating that the performance of N-Ace is comparable with that of other acetylation prediction methods. N-Ace not only provides a user-friendly input/output interface but also is a creative method for predicting protein acetylation sites. This novel analytical resource is now freely available at http://N-Ace.mbc.NCTU.edu.tw/.


Assuntos
Acetiltransferases/química , Aminoácidos/química , Processamento de Proteína Pós-Traducional , Proteínas/metabolismo , Solventes/química , Acetilação , Acetiltransferases/metabolismo , Sequência de Aminoácidos , Aminoácidos/metabolismo , Sítios de Ligação , Biocatálise , Interações Hidrofóbicas e Hidrofílicas , Ponto Isoelétrico , Proteínas/química , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...